Tibetan text classification based on RNN
نویسندگان
چکیده
Abstract In this paper, a deep learning RNN model is used to classify Tibetan texts. The core idea first preprocess the news corpus, and then use syllables construct syllable table based on lexical grammatical structure of Tibetan, embed in sentence, represent each as fixed Numerical vector. Secondly, cyclic neural network constructed. First, text different lengths filled or truncated into sequence length uniform length. For input text, vector representation time step train model. test samples were evaluate accuracy classification by introducing recall rate, precision rate F-test. Finally, compared with traditional machine Logistic algorithm, polynomial naive Bayes algorithm KNN results show that has better effect.
منابع مشابه
Tibetan Text Clustering Based on Machine Learning
Tibetan information processing technology has been obtained some achievements. But it falls behind Chinese and English information processing. It still needs to be paid more attention. Text clustering has the potential to accelerate the development of Tibetan information processing. In this paper, we propose an approach of Tibetan text clustering based on machine learning. Firstly, the approach...
متن کاملClassification-based RNN machine translation using GRUs
We report the results of our classification-based machine translation model, built upon the framework of a recurrent neural network using gated recurrent units. Unlike other RNN models that attempt to maximize the overall conditional log probability of sentences against sentences, our model focuses a classification approach of estimating the conditional probability of the next word given the in...
متن کاملChain Based RNN for Relation Classification
We present a novel approach for relation classification, using a recursive neural network (RNN), based on the shortest path between two entities in a dependency graph. Previous works on RNN are based on constituencybased parsing because phrasal nodes in a parse tree can capture compositionality in a sentence. Compared with constituency-based parse trees, dependency graphs can represent relation...
متن کاملBLSTM-RNN Based 3D Gesture Classification
This paper presents a new robust method for inertial MEM (MicroElectroMechanical systems) 3D gesture recognition. The linear acceleration and the angular velocity, respectively provided by the accelerometer and the gyrometer, are sampled in time resulting in 6D values at each time step which are used as inputs for the gesture recognition system. We propose to build a system based on Bidirection...
متن کاملOn Compression-Based Text Classification
Compression-based text classification methods are easy to apply, requiring virtually no preprocessing of the data. Most such methods are character-based, and thus have the potential to automatically capture non-word features of a document, such as punctuation, word-stems, and features spanning more than one word. However, compression-based classification methods have drawbacks (such as slow run...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Physics: Conference Series
سال: 2021
ISSN: ['1742-6588', '1742-6596']
DOI: https://doi.org/10.1088/1742-6596/1848/1/012139